Search results for "Statistics - Machine Learning"

showing 10 items of 90 documents

Approximation of functions over manifolds : A Moving Least-Squares approach

2021

We present an algorithm for approximating a function defined over a $d$-dimensional manifold utilizing only noisy function values at locations sampled from the manifold with noise. To produce the approximation we do not require any knowledge regarding the manifold other than its dimension $d$. We use the Manifold Moving Least-Squares approach of (Sober and Levin 2016) to reconstruct the atlas of charts and the approximation is built on-top of those charts. The resulting approximant is shown to be a function defined over a neighborhood of a manifold, approximating the originally sampled manifold. In other words, given a new point, located near the manifold, the approximation can be evaluated…

Computational Geometry (cs.CG)FOS: Computer and information sciencesComputer Science - Machine LearningClosed manifolddimension reductionMachine Learning (stat.ML)010103 numerical & computational mathematicsComplex dimensionTopology01 natural sciencesMachine Learning (cs.LG)Volume formComputer Science - GraphicsStatistics - Machine Learningmanifold learningApplied mathematics0101 mathematicsfunktiotMathematicsManifold alignmentAtlas (topology)Applied Mathematicshigh dimensional approximationManifoldGraphics (cs.GR)Statistical manifold010101 applied mathematicsregression over manifoldsComputational Mathematicsout-of-sample extensionComputer Science - Computational Geometrynumeerinen analyysimonistotapproksimointimoving least-squaresCenter manifold
researchProduct

Predicting overweight and obesity in later life from childhood data: A review of predictive modeling approaches

2019

Background: Overweight and obesity are an increasing phenomenon worldwide. Predicting future overweight or obesity early in the childhood reliably could enable a successful intervention by experts. While a lot of research has been done using explanatory modeling methods, capability of machine learning, and predictive modeling, in particular, remain mainly unexplored. In predictive modeling models are validated with previously unseen examples, giving a more accurate estimate of their performance and generalization ability in real-life scenarios. Objective: To find and review existing overweight or obesity research from the perspective of employing childhood data and predictive modeling metho…

Computer Science - Machine LearningStatistics - Machine LearningStatistics - Applications
researchProduct

Distributed Real-Time Sentiment Analysis for Big Data Social Streams

2014

Big data trend has enforced the data-centric systems to have continuous fast data streams. In recent years, real-time analytics on stream data has formed into a new research field, which aims to answer queries about "what-is-happening-now" with a negligible delay. The real challenge with real-time stream data processing is that it is impossible to store instances of data, and therefore online analytical algorithms are utilized. To perform real-time analytics, pre-processing of data should be performed in a way that only a short summary of stream is stored in main memory. In addition, due to high speed of arrival, average processing time for each instance of data should be in such a way that…

Data streamFOS: Computer and information sciencesComputer Science - Computation and LanguageComputer sciencebusiness.industryData stream miningSentiment analysisBig dataMachine Learning (stat.ML)Databases (cs.DB)Data structurecomputer.software_genreField (computer science)Computer Science - Information RetrievalTree (data structure)Computer Science - DatabasesComputer Science - Distributed Parallel and Cluster ComputingAnalyticsStatistics - Machine LearningData miningDistributed Parallel and Cluster Computing (cs.DC)businesscomputerComputation and Language (cs.CL)Information Retrieval (cs.IR)
researchProduct

Remote Sensing Image Classification with Large Scale Gaussian Processes

2017

Current remote sensing image classification problems have to deal with an unprecedented amount of heterogeneous and complex data sources. Upcoming missions will soon provide large data streams that will make land cover/use classification difficult. Machine learning classifiers can help at this, and many methods are currently available. A popular kernel classifier is the Gaussian process classifier (GPC), since it approaches the classification problem with a solid probabilistic treatment, thus yielding confidence intervals for the predictions as well as very competitive results to state-of-the-art neural networks and support vector machines. However, its computational cost is prohibitive for…

FOS: Computer and information sciences010504 meteorology & atmospheric sciencesComputer scienceMultispectral image0211 other engineering and technologiesMachine Learning (stat.ML)02 engineering and technologyLand cover01 natural sciencesStatistics - ApplicationsMachine Learning (cs.LG)Kernel (linear algebra)Bayes' theoremsymbols.namesakeStatistics - Machine LearningApplications (stat.AP)Electrical and Electronic EngineeringGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingContextual image classificationArtificial neural networkData stream miningProbabilistic logicSupport vector machineComputer Science - LearningKernel (image processing)symbolsGeneral Earth and Planetary Sciences
researchProduct

Automatic image-based identification and biomass estimation of invertebrates

2020

1. Understanding how biological communities respond to environmental changes is a key challenge in ecology and ecosystem management. The apparent decline of insect populations necessitates more biomonitoring but the time-consuming sorting and expert-based identification of taxa pose strong limitations on how many insect samples can be processed. In turn, this affects the scale of efforts to map and monitor invertebrate diversity altogether. Given recent advances in computer vision, we propose to enhance the standard human expert-based identification approach involving manual sorting and identification with an automatic image-based technology. 2. We describe a robot-enabled image-based ident…

FOS: Computer and information sciences0106 biological sciencesclassification (action)Computer Science - Machine Learninghahmontunnistus (tietotekniikka)Computer scienceImage qualityComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognitionclassificationsmodelling (creation related to information)neuroverkot01 natural sciencesConvolutional neural networkcomputer visionMachine Learning (cs.LG)remote sensingAbundance (ecology)Statistics - Machine Learningkonenäköinsectstunnistaminenbiodiversitysystematiikka (biologia)Ecological ModelingSortingselkärangattomatneural networksmuutosjohtaminenautomated pattern recognitionIdentification (information)machine learningkoneoppiminenclassificationEcosystem managementhämähäkitrecognitionmallintaminenneural networks (information technology)Machine Learning (stat.ML)010603 evolutionary biologyspidersidentifiointilajitsystematicsluokituksetEcology Evolution Behavior and Systematicsluokitus (toiminta)tarkkuusbusiness.industry010604 marine biology & hydrobiologyDeep learningPattern recognitiontypes and speciesidentification (recognition)15. Life on land113 Computer and information sciencesecosystems (ecology)invertebratesbiodiversiteettiekosysteemit (ekologia)hyönteisetidentificationprecisionkaukokartoitusArtificial intelligencechange management (leadership)businessScale (map)
researchProduct

Flood Detection On Low Cost Orbital Hardware

2019

Satellite imaging is a critical technology for monitoring and responding to natural disasters such as flooding. Despite the capabilities of modern satellites, there is still much to be desired from the perspective of first response organisations like UNICEF. Two main challenges are rapid access to data, and the ability to automatically identify flooded regions in images. We describe a prototypical flood segmentation system, identifying cloud, water and land, that could be deployed on a constellation of small satellites, performing processing on board to reduce downlink bandwidth by 2 orders of magnitude. We target PhiSat-1, part of the FSSCAT mission, which is planned to be launched by the …

FOS: Computer and information sciences: Computer science [C05] [Engineering computing & technology]Computer Science - Machine LearningImage and Video Processing (eess.IV): Multidisciplinary general & others [C99] [Engineering computing & technology]Machine Learning (stat.ML)Image and Video ProcessingElectrical Engineering and Systems Science - Image and Video Processing: Sciences informatiques [C05] [Ingénierie informatique & technologie]Machine Learning (cs.LG)Machine Learning: Multidisciplinaire généralités & autres [C99] [Ingénierie informatique & technologie]Artificial IntelligenceStatistics - Machine LearningSmall SatellitesFOS: Electrical engineering electronic engineering information engineeringFlood detectionEarth Observation: Aerospace & aeronautics engineering [C01] [Engineering computing & technology]: Ingénierie aérospatiale [C01] [Ingénierie informatique & technologie]
researchProduct

Explaining the unique nature of individual gait patterns with deep learning

2019

Machine learning (ML) techniques such as (deep) artificial neural networks (DNN) are solving very successfully a plethora of tasks and provide new predictive models for complex physical, chemical, biological and social systems. However, in most cases this comes with the disadvantage of acting as a black box, rarely providing information about what made them arrive at a particular prediction. This black box aspect of ML techniques can be problematic especially in medical diagnoses, so far hampering a clinical acceptance. The present paper studies the uniqueness of individual gait patterns in clinical biomechanics using DNNs. By attributing portions of the model predictions back to the input …

FOS: Computer and information sciencesAdultMaleComputer Science - Machine Learninglcsh:Rlcsh:MedicineMachine Learning (stat.ML)Healthy VolunteersArticleMachine Learning (cs.LG)Biomechanical PhenomenaYoung AdultDeep LearningStatistics - Machine LearningHumanslcsh:QFemale000 Allgemeineslcsh:ScienceGait000 Generalities
researchProduct

Nonlinearities and Adaptation of Color Vision from Sequential Principal Curves Analysis

2016

Mechanisms of human color vision are characterized by two phenomenological aspects: the system is nonlinear and adaptive to changing environments. Conventional attempts to derive these features from statistics use separate arguments for each aspect. The few statistical explanations that do consider both phenomena simultaneously follow parametric formulations based on empirical models. Therefore, it may be argued that the behavior does not come directly from the color statistics but from the convenient functional form adopted. In addition, many times the whole statistical analysis is based on simplified databases that disregard relevant physical effects in the input signal, as, for instance…

FOS: Computer and information sciencesColor visionComputer scienceCognitive NeuroscienceComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONStandard illuminantMachine Learning (stat.ML)Models BiologicalArts and Humanities (miscellaneous)Statistics - Machine LearningPsychophysicsHumansLearningComputer SimulationChromatic scaleParametric statisticsPrincipal Component AnalysisColor VisionNonlinear dimensionality reductionAdaptation PhysiologicalNonlinear systemNonlinear DynamicsFOS: Biological sciencesQuantitative Biology - Neurons and CognitionMetric (mathematics)A priori and a posterioriNeurons and Cognition (q-bio.NC)AlgorithmColor PerceptionPhotic Stimulation
researchProduct

Optimized Kernel Entropy Components

2016

This work addresses two main issues of the standard Kernel Entropy Component Analysis (KECA) algorithm: the optimization of the kernel decomposition and the optimization of the Gaussian kernel parameter. KECA roughly reduces to a sorting of the importance of kernel eigenvectors by entropy instead of by variance as in Kernel Principal Components Analysis. In this work, we propose an extension of the KECA method, named Optimized KECA (OKECA), that directly extracts the optimal features retaining most of the data entropy by means of compacting the information in very few features (often in just one or two). The proposed method produces features which have higher expressive power. In particular…

FOS: Computer and information sciencesComputer Networks and CommunicationsKernel density estimationMachine Learning (stat.ML)02 engineering and technologyKernel principal component analysisMachine Learning (cs.LG)Artificial IntelligencePolynomial kernelStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringMathematicsbusiness.industry020206 networking & telecommunicationsPattern recognitionComputer Science ApplicationsComputer Science - LearningKernel methodKernel embedding of distributionsVariable kernel density estimationRadial basis function kernelKernel smoother020201 artificial intelligence & image processingArtificial intelligencebusinessSoftwareIEEE Transactions on Neural Networks and Learning Systems
researchProduct

Simplifying Probabilistic Expressions in Causal Inference

2018

Obtaining a non-parametric expression for an interventional distribution is one of the most fundamental tasks in causal inference. Such an expression can be obtained for an identifiable causal effect by an algorithm or by manual application of do-calculus. Often we are left with a complicated expression which can lead to biased or inefficient estimates when missing data or measurement errors are involved. We present an automatic simplification algorithm that seeks to eliminate symbolically unnecessary variables from these expressions by taking advantage of the structure of the underlying graphical model. Our method is applicable to all causal effect formulas and is readily available in the …

FOS: Computer and information sciencesComputer Science - Artificial Intelligencegraph theoryyksinkertaisuussimplificationgraphical modelMachine Learning (stat.ML)Machine Learning (cs.LG)Computer Science - Learningprobabilistic expressionArtificial Intelligence (cs.AI)Statistics - Machine Learningkausaliteettipiirrosmerkitcausal inferencegraafit
researchProduct